Variable Resolution Hierarchical RL
نویسنده
چکیده
The contribution of this paper is to introduce heuristics, that go beyond safe state abstraction in hierarchical reinforcement learning, to approximate a decomposed value function. Additional improvements in time and space complexity for learning and execution may outweigh achieving less than hierarchically optimal performance and deliver anytime decision making during execution. Heuristics are discussed in relation to HEXQ, a MDP partitioning that generates a hierarchy of abstract models using safe state abstraction. The approximation methods are illustrated empirically.
منابع مشابه
Efficient Reinforcement Learning in Continuous Environments
Reinforcement Learning (RL) is a machine learning paradigm with which autonomous agents can improve their behavior in unknown environments based on their own experience without an explicit teacher signal. RL algorithms are based on estimating a value function over the state space, and scaling them to large state spaces remains a challenge. One approach, known as variable resolution, is to focus...
متن کاملHierarchical Policy Search via Return-Weighted Density Estimation
Learning an optimal policy from a multi-modal reward function is a challenging problem in reinforcement learning (RL). Hierarchical RL (HRL) tackles this problem by learning a hierarchical policy, where multiple option policies are in charge of different strategies corresponding to modes of a reward function and a gating policy selects the best option for a given context. Although HRL has been ...
متن کاملArgumentation accelerated reinforcement learning
Reinforcement Learning (RL) is a popular statistical Artificial Intelligence (AI) technique for building autonomous agents, but it suffers from the curse of dimensionality: the computational requirement for obtaining the optimal policies grows exponentially with the size of the state space. Integrating heuristics into RL has proven to be an effective approach to combat this curse, but deriving ...
متن کاملReinforcement Learning Hierarchical Neuro-Fuzzy Politree Model for Control of Autonomous Agents
This work presents a new hybrid neuro-fuzzy model for automatic learning of actions taken by agents. The main objective of this new model is to provide an agent with intelligence, making it capable, by interacting with its environment, to acquire and retain knowledge for reasoning (infer an action). This new model, named Reinforcement Learning Hierarchical Neuro-Fuzzy Politree (RL-HNFP), and it...
متن کاملProactive and Adaptive Data Migration in Hierarchical Storage Systems using Reinforcement Learning Agent
With the data generation rates growing exponentially, businesses are having a difficult time maintaining data center infrastructure. Hierarchical storage systems has evolved as a better alternate to managing data, as frequently accessed data is placed on higher tiers and the least frequently accessed data on lower tiers. But the data arrangement is not always static. Data Migration is an operat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003